SEQUENCE LISTING 

v 
\ 

<110> ITOH, Kyogo \ 
SHICHIJO, Shigeki \ 
IMAI, Yasuhisa \ 

<120> TUMOR ANTIGEN PROTEINS, \gENES THEREFOR, AND TUMOR 
ANTIGEN PEPTIDES \ 

<130> 0020-4491P \ 

<140> 09/202,047 1 
<141> 1998-12-07 \ 

<160> 2 \ 

<170> Patentln Ver. 2.1 1 

<210> 1 1 

<211> 2527 1 

<212> DNA \ 
<213> Homo sapiens \ 

<220> \ 
<221> 5 1 UTR \ 
<222> (1) . . (38) \ 

<220> \ 
<221> CDS \ 
<222> (39) . . (2438) \ 

<220> , \ 

<221> 3 1 UTR \ 
<222> (2439) . . (2506) \ 

<220> \ 
<221> polyA__site \ 
<222> (2507) . . (2527) \ 

<400> 1 \ 

ggttcggcgg cagccgggct cggagtggac gtgccact atg gag teg tec aag aag 56 

Met GIW Ser Ser Lys Lys 
1 \ 5 

cat cgc gga gag aag gag gcg gec ggg acg acg gcg gcg gee ggc acc 104 
His Arg Gly Glu Lys Glu Ala Ala Gly Thr Thr Ala AlA Ala Gly Thr 
10 15 \ 20 



1 



ggg gfrt gcc acc gag cag ccg ccg egg cac egg gaa cac aaa aaa cac 
Gly Gly\Ala Thr Glu Gin Pro Pro Arg His Arg Glu His Lys Lys His 
k5 30 35 

aag cac egg aVt ggc ggc agt ggc ggt age ggt ggc gaa cga egg aag 
Lys His Arg Se\ Gly Gly Ser Gly Gly Ser Gly Gly Glu Arg Arg Lys 
40 \ 45 50 

egg age egg gaa cgk ggg ggc gag cgc ggg age ggg egg cgc ggg gcc 
Arg Ser Arg Glu ArgVly Gly Glu Arg Gly Ser Gly Arg Arg Gly Ala 
55 \o 65 70 



gaa get gag gcc egg age 
Glu Ala Glu Ala Arg Ser 
75 



igc acg cac ggg egg gag cgc age cag gca 
sr Thr His Gly Arg Glu Arg Ser Gin Ala 
80 85 




gag ccc tec gag egg cgc gtg a^g egg gag aag cgc gat gac ggc tac 
Glu Pro Ser Glu Arg Arg Val Lye Arg Glu Lys Arg Asp Asp Gly Tyr 
90 \ 95 100 

gag gcc get gcc age tec aaa act ag^c tea ggc gat gcc tec tea etc 
Glu Ala Ala Ala Ser Ser Lys Thr Ser\Ser Gly Asp Ala Ser Ser Leu 
105 110 \ 115 

age ate gag gag act aac aaa etc egg gca Vag ttg ggg ctg aaa ccc 
Ser lie Glu Glu Thr Asn Lys Leu Arg Ala LW Leu Gly Leu Lys Pro 
120 125 \l30 

ttg gag gtt aat gcc ate aag aag gag gcg ggc acX aag gag gag ccc 
Leu Glu Val Asn Ala lie Lys Lys Glu Ala Gly Thr\ys Glu Glu Pro 
135 140 145 \ 150 

gtg aca get gat gtc ate aac cct atg gcc ttg cga cag daa gag gag 
Val Thr Ala Asp Val lie Asn Pro Met Ala Leu Arg Gin Arg Glu Glu 
155 160 \165 



152 



200 



248 



296 



344 



392 



440 



488 



536 



ctg egg gag aag ctg gcg get gcc aag gag aag cgc ctg ctg aac caa 
Leu Arg Glu Lys Leu Ala Ala Ala Lys Glu Lys Arg Leu Leu Asn\£ln 
170 175 180 



584 



aag ctg ggg aag ata aag acc eta gga gag gat gac ccc tgg ctg gac\ 
Lys Leu Gly Lys lie Lys Thr Leu Gly Glu Asp Asp Pro Trp Leu Asp 
185 190 195 



632 



gac act gca gcc tgg ate gag agg age egg cag ctg cag aag gag aag 
Asp Thr Ala Ala Trp lie Glu Arg Ser Arg Gin Leu Gin Lys Glu Lys 
200 205 210 



68.0 



2 



gac rs±g gca gag aag agg gcc aag tta ctg gag gag atg gac caa gag 

Asp [Le\i Ala Glu Lys Arg Ala Lys Leu Leu Glu Glu Met Asp Gin Glu 

215 v \ 220 225 230 

ttt ggt gt\ age act ctg gtg gag gag gag ttc ggg cag agg egg cag 

Phe Gly Valuer Thr Leu Val Glu Glu Glu Phe Gly Gin Arg Arg Gin 
235 240 245 



gac ctg tac agt 
Asp Leu Tyr Ser 
250 



jee egg gac ctg cag ggc etc acc gtg gag cat gcc 
{a Arg Asp Leu Gin Gly Leu Thr Val Glu His Ala 
255 . 260 




att gat tec ttc cga 
lie Asp Ser Phe Arg 
265 



a ggg gag aca atg att ctt acc etc aag gac 
Gly Glu Thr Met lie Leu Thr Leu Lys Asp 
270 275 



aaa ggc gtg ctg cag gag ga^g gag gac gtg ctg gtg aac gtg aac ctg 
Lys Gly Val Leu Gin Glu Glu\Glu Asp Val Leu Val Asn Val Asn Leu 
280 285 \ 290 



gtg gat aag gag egg gca gag aaa 
Val Asp Lys Glu Arg Ala Glu Lys 
295 300 



at gtg gag ctg egg aag aag aag 
n Val Glu Leu Arg Lys Lys Lys 
305 310 



728 



776 



824 



872 



920 



968 



cct gac tac ctg ccc tat gcc gag gac gag age gtg gac gac ctg gcg 
Pro Asp Tyr Leu Pro Tyr Ala Glu Asp Glu\ser Val Asp Asp Leu Ala 
315 320 \ 325 



1016 



X 



cag caa aaa cct cgc tct ate ctg tec aag tat gac gaa gag ctt gaa 
Gin Gin Lys Pro Arg Ser lie Leu Ser Lys Tyr Asp Glu Glu Leu Glu 



330 



335 



340 



1064 



ggg gag egg cca cat tec ttc cgc ttg gag cag ggc gge; acg get gat 
Gly Glu Arg Pro His Ser Phe Arg Leu Glu Gin Gly Gly\ Thr Ala Asp 
345 350 355 



1112 



ggc ctg egg gag egg gag ctg gag gag ate egg gcc aag ctg. egg ctg 
Gly Leu Arg Glu Arg Glu Leu Glu Glu lie Arg Ala Lys Leu\rq Leu 
360 365 370 




cag get cag tec ctg age aca gtg ggg ccc egg ctg gcc tec gaa Ntac 
Gin Ala Gin Ser Leu Ser Thr Val Gly Pro Arg Leu Ala Ser Glu T^r 
375 380 385 39^ 



1160 



1208 



etc acg cct gag gag atg gtg acc ttt aaa aag acc aag egg agg gtg 
Leu Thr Pro Glu Glu Met Val Thr Phe Lys Lys Thr Lys Arg Arg Val 
395 400 405 



1256 



aag aaa ate cgc aag aag gag aag gag gta gta gtg egg gca gat gac 
Lys Lys\lle Arg Lys Lys Glu Lys Glu Val Val Val Arg Ala Asp Asp 
410 415 420 



1304 



ttg ctg cctxetc ggg gac cag act cag gat ggg gac ttt ggt tec aga 
Leu Leu Pro Lfeu Gly Asp Gin Thr Gin Asp Gly Asp Phe Gly Ser Arg 
425 \ 430 435 



1352 



ctg egg gga egg 
Leu Arg Gly Arg 
440 



t cgc cgc cga gtg tec gaa gtg gag gag gag aag 
GlV Arg Arg Arg Val Ser Glu Val Glu Glu Glu Lys 
445 450 



1400 



gag cct gtg cct cag ccic ctg ccg teg gac gac acc cga gtg gag aac 
Glu Pro Val Pro Gin Pr\ Leu Pro Ser Asp Asp Thr Arg Val Glu Asn 
455 460\ 465 470 



1448 



atg gac ate agt gat gag ga& gaa ggt gga get cca ccg ccg ggg tec 
Met Asp lie Ser Asp Glu GluVlu Gly Gly Ala Pro Pro Pro Gly Ser 
475 \ 480 485 



1496 



ccg cag gtg ctg gag gag gac gag §cg gag ctg gag ctg cag aag cag 
Pro Gin Val Leu Glu Glu Asp Glu APa Glu Leu Glu Leu Gin Lys Gin 
490 495\ 500 



1544 



ctg gag aag gga cgc egg ctg cga cag tta^cag cag eta cag cag ctg 
Leu Glu Lys Gly Arg Arg Leu Arg Gin Leu^Gln Gin Leu Gin Gin Leu 
505 510 \ 515 



1592 



cga gac agt ggc gag aag gtg gtg gag att gtg a>ag aag ctg gag tct 
Arg Asp Ser Gly Glu Lys Val Val Glu lie Val Lys Lys Leu Glu Ser 
520 525 530 s 



1640 



cgc cag egg ggc tgg gag gag gat gag gat ccc gag cg\ aag ggg gee 
Arg Gin Arg Gly Trp Glu Glu Asp Glu Asp Pro Glu ArgNLys Gly Ala 
535 540 545 \ 550 



1688 



ate gtg ttc aac gee acg tec gag ttc tgc cgc acc ttg ggg\gag ate 1736 
He Val Phe Asn Ala Thr Ser Glu Phe Cys Arg Thr Leu Gly Glu He 
555 560 565 



ccc acc tac ggg ctg get ggc aat cgc gag gag cag gag gag etc a\g 
Pro Thr Tyr Gly Leu Ala Gly Asn Arg Glu Glu Gin Glu Glu Leu Me 
570 575 580 



1784 



gac ttt gaa egg gat gag gag cgc tea gee aac ggt ggc tec gaa tct 
Asp Phe Glu Arg Asp Glu Glu Arg Ser Ala Asn Gly Gly Ser Glu Ser 
585 590 595 



4 



gac ggg <3jag gag aac ate ggc tgg age acg gtg aac ctg gac gag gag 
Asp Gly GIuXGlu Asn lie Gly Trp Ser Thr Val Asn Leu Asp Glu Glu 
600 \ \ 605 610 



1880 



aag cag cag cag Vjat ttc tct get tec tec ace ace ate ctg gac gag 
Lys Gin Gin Gin Afcp Phe Ser Ala Ser Ser Thr Thr lie Leu Asp Glu 
615 \ 620 625 630 



1928 



gaa ccg ate gtg aat a>ug ggg ctg gca get gee ctg etc ctg tgt cag 
Glu Pro lie Val Asn Ar\ Gly Leu Ala Ala Ala Leu Leu Leu Cys Gin 
635 \ 640 645 



1976 



aac aaa ggg ctg ctg gag ac>c aca gtg cag aag gtg gee egg gtg aag 
Asn Lys Gly Leu Leu Glu ThrVThr Val Gin Lys Val Ala Arg Val Lys 
650 \ 655 660 



2024 



gee ccc aac aag teg ctg ccc tca\gcc gtg tac tgc ate gag gat aag 
Ala Pro Asn Lys Ser Leu Pro Ser Via Val Tyr Cys lie Glu Asp Lys 
665 670 \ 675 



2072 



atg gec ate gat gac aag tac age cggNagg gag gaa tac cga ggc ttc 
Met Ala lie Asp Asp Lys Tyr Ser Arg Arg Glu Glu Tyr Arg Gly Phe 
680 685 \ 690 



2120 



aca cag gac ttc aag gag aag gac ggc tac aaa ccc gac gtt aag ate 
Thr Gin Asp Phe Lys Glu Lys Asp Gly Tyr L^s Pro Asp Val Lys lie 
695 700 705\ 710 



2168 



gaa tac gtg gat gag acg ggc egg aaa etc aca ccsc aag gag get ttc 
Glu Tyr Val Asp Glu Thr Gly Arg Lys Leu Thr Pr\ Lys Glu Ala Phe 
715 720 \ 725 



2216 



egg cag ctg teg cac cgc ttc cat ggc aag ggc tea ggc^aag atg aag 
Arg Gin Leu Ser His Arg Phe His Gly Lys Gly Ser Gly Yys Met Lys 
730 735 74,0 



2264 



aca gag egg egg atg aag aag ctg gac gag gag gcg etc ctg aag aag 
Thr Glu Arg Arg Met Lys Lys Leu Asp Glu Glu Ala . Leu Leu L^s Lys 
745 750 755 



2312 



atg age tec age gac acg ccc ctg ggc ace gtg gee ctg etc cag gag 
Met Ser Ser Ser Asp Thr Pro Leu Gly Thr Val Ala Leu Leu Gin Gl\ 
760 765 770 



2360 



aag cag aag get cag aag acc ccc tac ate gtg etc age ggc age ggc 
Lys Gin Lys Ala Gin Lys Thr Pro Tyr lie Val Leu Ser Gly Ser Gly 
775 780 785 790 



aag age atg aac gcg aac acc ate acc aag tgacagcgcc ctcccgtagt 2458 
Lys Ser M^ Asn Ala Asn Thr He Thr Lys 
795 800 



cggccctgcc tcaa\cttca tattaaataa agctccctcc ttatttttaa aaaaaaaaaa 2518 

2527 



aaaaaaaaa 



<210> 2 
<211> 800 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Gly Ser Ser Lys Lys His Arg\Gly Glu Lys Glu Ala Ala Gly Thr 
1 5 \ 10 15 

Thr Ala Ala Ala Gly Thr Gly Gly Ala\ Thr Glu Gin Pro Pro Arg His 
20 25 \ 30 

Arg Glu His Lys Lys His Lys His Arg Ser\Gly Gly Ser Gly Gly Ser 
35 40 \ 45 



Gly Gly Glu Arg Arg Lys Arg Ser Arg Glu 
50 55 



Arg\Gly Gly Glu Arg Gly 
k60 



Ser Gly Arg Arg Gly Ala Glu Ala Glu Ala Arg Ser\Ser Thr His Gly 
65 70 75 \ 80 

Arg Glu Arg Ser Gin Ala Glu Pro Ser Glu Arg Arg Val Yys Arg Glu 
85 90 \ 95 

Lys Arg Asp Asp Gly Tyr Glu Ala Ala Ala Ser Ser Lys Thr S^er Ser 
100 105 110 

Gly Asp Ala Ser Ser Leu Ser He Glu Glu Thr Asn Lys Leu Arg Ala 
115 120 125 



Lys Leu Gly Leu Lys Pro Leu Glu Val Asn Ala He Lys Lys Glu Ala 
130 135 140 



Gly Thr Lys Glu Glu Pro Val Thr Ala Asp Val He Asn Pro Met Ala 
145 150 155 160 

Leu Arg Gin Arg Glu Glu Leu Arg Glu Lys Leu Ala Ala Ala Lys Glu 
165 170 175 



6 



Lys Ar§\Leu Leu Asn Gin Lys Leu Gly Lys lie Lys Thr Leu Gly Glu 
180 185 190 

Asp Asp Pro \rp Leu Asp Asp Thr Ala Ala Trp He Glu Arg Ser Arg 
195 \ 200 205 

Gin Leu Gin Lys g\u Lys Asp Leu Ala Glu Lys Arg Ala Lys Leu Leu 
210 \ 215 220 

Glu Glu Met Asp Gin Gl\ Phe Gly Val Ser Thr Leu Val Glu Glu Glu 
225 230\ 235 240 

Phe Gly Gin Arg Arg Gin As\ Leu Tyr Ser Ala Arg Asp Leu Gin Gly 
245 \ 250 255 

Leu Thr Val Glu His Ala He As\ Ser Phe Arg Glu Gly Glu Thr Met 
260 V65 270 

He Leu Thr Leu Lys Asp Lys Gly Va\ Leu Gin Glu Glu Glu Asp Val 
275 280 \ 285 

Leu Val Asn Val Asn Leu Val Asp Lys Gl\ Arg Ala Glu Lys Asn Val 
290 295 \ 300 

Glu Leu Arg Lys Lys Lys Pro Asp Tyr Leu ProVFyr Ala Glu Asp Glu 
305 310 315 \ 320 

Ser Val Asp Asp Leu Ala Gin Gin Lys Pro Arg SeAlle Leu Ser Lys 
325 330 \ 335 

Tyr Asp Glu Glu Leu Glu Gly Glu Arg Pro His Ser Phe Yrg Leu Glu 
340 345 3\o 

Gin Gly Gly Thr Ala Asp Gly Leu Arg Glu Arg Glu Leu Glu <^lu He 
355 360 365 

Arg Ala Lys Leu Arg Leu Gin Ala Gin Ser Leu Ser Thr Val Gly Hro 
370 375 380 

Arg Leu Ala Ser Glu Tyr Leu Thr Pro Glu Glu Met Val Thr Phe Lys 
385 390 395 400 

Lys Thr Lys Arg Arg Val Lys Lys He Arg Lys Lys Glu Lys Glu Val 
405 410 415 



Val Val Arg Ala Asp Asp Leu Leu Pro Leu Gly Asp Gin Thr Gin Asp 
420 425 430 



7 



Gly Asp\Phe Gly Ser Arg Leu Arg Gly Arg Gly Arg Arg Arg Val Ser 
55 4 4 0 4 4 5 

Glu Val Glu "&lu Glu Lys Glu Pro Val Pro Gin Pro Leu Pro Ser Asp 
450 \ 455 460 

Asp Thr Arg Val Gisu Asn Met Asp He Ser Asp Glu Glu Glu Gly Gly 
465 \470 475 480 

Ala Pro Pro Pro Gly Se\ Pro Gin Val Leu Glu Glu Asp Glu Ala Glu 
485 \ 490 495 

Leu Glu Leu Gin Lys Gin Le\i Glu Lys Gly Arg Arg Leu Arg Gin Leu 
500 \ 505 510 

Gin Gin Leu Gin Gin Leu Arg AsY Ser Gly Glu Lys Val Val Glu He 
515 520\ 525 

Val Lys Lys Leu Glu Ser Arg Gin Aha Gly Trp Glu Glu Asp Glu Asp 
530 535 \ 540 

Pro Glu Arg Lys Gly Ala He Val Phe aW Ala Thr Ser Glu Phe Cys 
545 550 \ 555 560 



Arg Thr Leu Gly Glu He Pro Thr Tyr Gly 
565 570 



Ala Gly Asn Arg Glu 
575 




Glu Gin Glu Glu Leu Met Asp Phe Glu Arg Asp Glu Glu Arg Ser Ala 
580 585 \ 590 



Asn Gly Gly Ser Glu Ser Asp Gly Glu Glu Asn He GJsy Trp Ser Thr 

595 600 605S 

Val Asn Leu Asp Glu Glu Lys Gin Gin Gin Asp Phe Ser Al\ Ser Ser 

610 615 620 



Thr Thr lie Leu Asp Glu Glu Pro He Val Asn Arg Gly Leu Alk Ala 
625 630 635 \?40 



Ala Leu Leu Leu Cys Gin Asn Lys Gly Leu Leu Glu Thr Thr Val Gl 
645 650 655 



Lys Val Ala Arg Val Lys Ala Pro Asn Lys Ser Leu Pro Ser Ala Val 
660 665 670 



Tyr Cys He Glu Asp Lys Met Ala He Asp Asp Lys Tyr Ser Arg Arg 
675 680 685 



8 



Glu Glu Tyr Sfca Gly Phe Thr Gin Asp Phe Lys Glu Lys Asp Gly Tyr 
690 695 700 

Lys Pro Asp Val Lys Ile^S^u Tyr Val Asp Glu Thr Gly Arg Lys Leu 
705 710 715 720 

Thr Pro Lys Glu Ala Phe Arg Gin lW Ser His Arg Phe His Gly Lys 
725 735 735 

Gly Ser Gly Lys Met Lys Thr Glu Arg Arg Me'K.Lys Lys Leu Asp Glu 
740 745 \ 750 

Glu Ala Leu Leu Lys Lys Met Ser Ser Ser Asp Thr Pr^Leu Gly Thr 
755 760 765 

Val Ala Leu Leu Gin Glu Lys Gin Lys Ala Gin Lys Thr Pro Ty^Ile 
770 775 780 



Val Leu Ser Gly Ser Gly Lys Ser Met Asn Ala Asn Thr lie Thr Lys 
785 790 795 800 



42 




SEQUENCE LISTING 

SEQ ID NO: 1 

SEQUENCE LENGTH: „00 amino acids 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: human {Homo sapiens) 
TISSUE TYPE: esophageal carcino/na tissue 
10^ FEATURE: 

FEATURE KEY: peptide 
LOCATION: 1.800 
IDENTIFICATION METHOD: P 
SEQUENCE DESCRIPTION: 

3 15 Met Gly Ser Ser Lys jLys His Arg Gly Glu Lys Glu Ala Ala Gly Thr 

10 15 
Thr Ala Ala Ala/Gly Thr Gly Gly Ala Thr Glu Gin Pro Pro Arg His 

(o 25 30 

Arg Glu His 7 Lys Lys His Lys His Arg Ser Gly Gly Ser Gly Gly Ser 
20 /35 40 45 

Gly GlV Glu Arg Arg Lys Arg Ser Arg Glu Arg Gly Gly Glu Arg Gly 

/50 55 60 

Serf Gly Arg Arg Gly Ala Glu Ala Glu Ala Arg Ser Ser Thr His Gly 
)5 70 75 80 

25 Arg Glu Arg Ser Gin Ala Glu Pro Ser Glu Arg Arg Val Lys Arg Glu 



\ 



W 



43 



85 90 95 / 

Lys Arg Asp Asp Gly Tyr Glu Ala Ala Ala Ser Ser Lys Thr Ser/Ser 

100 105 110 / 

Gly Asp Ala Ser Ser Leu Ser He Glu Glu Thr Asn Lys Leti Arg Ala 

115 120 125/ 

Lys Leu Gly Leu Lys Pro Leu Glu Val Asn Ala He iSys Lys Glu Ala 

130 135 L40 

Gly Thr Lys Glu Glu Pro Val Thr Ala Asp Va/ He Asn Pro Met Ala 
145 150 /155 160 

Leu Arg Gin Arg Glu Glu Leu Arg Glu Lys Leu Ala Ala Ala Lys Glu 

165 / 170 175 

Lys Arg Leu Leu Asn Gin Lys Leu/Gly Lys He Lys Thr Leu Gly Glu 

180 / 185 190 

Asp Asp Pro Trp Leu Asp Asp Thr Ala Ala Trp He Glu Arg Ser Arg 

195 / 200 205 

Gin Leu Gin Lys Glu Lys Asp Leu Ala Glu Lys Arg Ala Lys Leu Leu 

210 / 215 220 

Glu Glu Met Asp Gin Glu Phe Gly Val Ser Thr Leu Val Glu Glu Glu 
225 / 230 235 240 

Phe Gly Gin Arg Arg Gin Asp Leu Tyr Ser Ala Arg Asp Leu Gin Gly 

/ 245 250 255 

Leu Thr/Val Glu His Ala He Asp Ser Phe Arg Glu Gly Glu Thr Met 

/ 260 265 270 

Ile/Leu Thr Leu Lys Asp Lys Gly Val Leu Gin Glu Glu Glu Asp Val 

/ 275 280 285 

Leu Val Asn Val Asn Leu Val Asp Lys Glu Arg Ala Glu Lys Asn Val 
290 295 300 



44 



Glu Leu Arg Lys Lys Lys Pro Asp Tyr Leu Pro Tyr Ala Glu Asp GYu 
305 310 315 / 320 

Ser Val Asp Asp Leu Ala Gin Gin Lys Pro Arg Ser He Leu/Ser Lys 

325 330 / 335 

Tyr Asp Glu Glu Leu Glu Gly Glu Arg Pro His Ser P/e Arg Leu Glu 

340 345 / 350 

Gin Gly Gly Thr Ala Asp Gly Leu Arg Glu Arg/Glu Leu Glu Glu He 

355 360 / 365 

Arg Ala Lys Leu Arg Leu Gin Ala Gin Ser Leu Ser Thr Val Gly Pro 

370 375 / 380 

Arg Leu Ala Ser Glu Tyr Leu Thr /ro Glu Glu Met Val Thr Phe Lys 
385 390 / 395 400 

Lys Thr Lys Arg Arg Val Ly/ Lys He Arg Lys Lys Glu Lys Glu Val 

405 / 410 415 

Val Val Arg Ala Asp Asp Leu Leu Pro Leu Gly Asp Gin Thr Gin Asp 

420 / 425 430 

Gly Asp Phe Gly Jber Arg Leu Arg Gly Arg Gly Arg Arg Arg Val Ser 

435 / 440 445 

Glu Val Glu/Glu Glu Lys Glu Pro Val Pro Gin Pro Leu Pro Ser Asp 

450 / 455 460 

Asp Thj/Arg Val Glu Asn Met Asp He Ser Asp Glu Glu Glu Gly Gly 
465 / 470 475 480 

Al/ Pro Pro Pro Gly Ser Pro Gin Val Leu Glu Glu Asp Glu Ala Glu 

485 490 495 

Leu Glu Leu Gin Lys Gin Leu Glu Lys Gly Arg Arg Leu Arg Gin Leu 

500 505 510 

Gin Gin Leu Gin Gin Leu Arg Asp Ser Gly Glu Lys Val Val Glu He 



# 



45 



515 520 525 

Val Lys Lys Leu Glu Ser Arg Gin Arg Gly Trp Glu Glu Asp Glu Asp 

530 535 540 

Pro Glu Arg Lys Gly Ala He Val Phe Asn Ala Thr Ser Glu PheyCys 
5 545 550 555 / 560 

Arg Thr Leu Gly Glu He Pro Thr Tyr Gly Leu Ala Gly Asn Arg Glu 

565 570 / 575 

Glu Gin Glu Glu Leu Met Asp Phe Glu Arg Asp Glu/Glu Arg Ser Ala 
580 585 / 590 

10 Asn Gly Gly Ser Glu Ser Asp Gly Glu Glu As/ lie Gly Trp Ser Thr 
595 600 / 605 

Val Asn Leu Asp Glu Glu Lys Gin Gin Gin Asp Phe Ser Ala Ser Ser 

610 615 / 620 

Thr Thr He Leu Asp Glu Glu Pro/lie Val Asn Arg Gly Leu Ala Ala 
625 630 / 635 640 

Ala Leu Leu Leu Cys Gin Asfi Lys Gly Leu Leu Glu Thr Thr Val Gin 

645 / 650 655 

Lys Val Ala Arg Val I^ys Ala Pro Asn Lys Ser Leu Pro Ser Ala Val 
660 / 665 670 

20 Tyr Cys lie Glu /sp Lys Met Ala He Asp Asp Lys Tyr Ser Arg Arg 
675 / 680 685 

Glu Glu Tyr/ftrg Gly Phe Thr Gin Asp Phe Lys Glu Lys Asp Gly Tyr 

690 / 695 700 

Lys Pro/Asp Val Lys He Glu Tyr Val Asp Glu Thr Gly Arg Lys Leu 
25 705 / 710 715 720 

Thr/Pro Lys Glu Ala Phe Arg Gin Leu Ser His Arg Phe His Gly Lys 
725 730 735 





46 




Gly Ser Gly Lys Met Lys Thr Glu Arg Arg Met Lys Lys Leu Asp G] 

740 745 750 

Glu Ala Leu Leu Lys Lys Met Ser Ser Ser Asp Thr Pro Leu /5ly Thr 

755 760 765 

Val Ala Leu Leu Gin Glu Lys Gin Lys Ala Gin Lys Th/ Pro Tyr lie 

770 775 780/ 

Val Leu Ser Gly Ser Gly Lys Ser Met Asn Ala /fsn Thr He Thr Lys 
785 790 796 800 

10 SEQ ID NO: 2 

SEQUENCE LENGTH: 2527 base pyairs 
SEQUENCE TYPE: nucleic acid, 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: No, 
ANTI-SENSE: No 
ORIGINAL SOURCE: 

ORGANISM: /human {Homo sapiens) 
20 TISSUE TY/PE: esophageal carcinoma tissue 
FEATURE: 

FEATURE KEY: 5' UTR 
LOCATION: 1.38 
IDENTIFICATION METHOD: E 



25 




FEATURE KEY: CDS 
LOCATION: 39. .2438 
IDENTIFICATION METHOD: E 

FEATURE KEY: 3' UTR 
LOCATION: 2439. .2506 
IDENTIFICATION METHOD: E 

FEATURE KEY: poly A site 
LOCATION: 2507. .2527 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 



GGTTCGGCGG 


CAGCCGGGCT 


CGGAGTGGAC 


GTGCCACTAT 


GGGGTCGTCC 


AAGAAGCATC 


60 


GCGGAGAGAA 


GGAGGCGGCC 


GGGACG/CGG 


CGGCGGCCGG 


CACCGGGGGT 


GCCACCGAGC 


120 


AGCCGCCGCG 


GCACCGGGAA 


CACAAAAAAC 


ACAAGCACCG 


GAGTGGCGGC 


AGTGGCGGTA 


180 


GCGGTGGCGA 


ACGACGGAAG 


CGGAGCCGGG 


AACGTGGGGG 


CGAGCGCGGG 


AGCGGGCGGC 


240 


GCGGGGCCGA 


AGCTGAGGCG / 


CGGAGCAGCA 


CGCACGGGCG 


GGAGCGCAGC 


CAGGCAGAGC 


300 


CCTCCGAGCG 


gcgcgtgXag 


CGGGAGAAGC 


GCGATGACGG 


CTACGAGGCC 


GCTGCCAGCT 


360 


CCAAAACTAG 


CTCAGGCGAT 


GCCTCCTCAC 


TCAGCATCGA 


GGAGACTAAC 


AAACTCCGGG 


420 


CAAAGTTGGG 


GC^GAAACCC 


TTGGAGGTTA 


ATGCCATCAA 


GAAGGAGGCG 


GGCACCAAGG 


480 


AGGAGCCCGT 


yGACAGCTGAT 


GTCATCAACC 


CTATGGCCTT 


GCGACAGCGA 


GAGGAGCTGC 


540 


GGGAGAAGGT 


GGCGGCTGCC 


AAGGAGAAGC 


GCCTGCTGAA 


CCAAAAGCTG 


GGGAAGATAA 


600 


AGACCCTAGG 


AGAGGATGAC 


CCCTGGCTGG 


ACGACACTGC 


AGCCTGGATC 


GAGAGGAGCC 


660 


GGCAGCTGCA 


GAAGGAGAAG 


GACCTGGCAG 


AGAAGAGGGC 


CAAGTTACTG 


GAGGAGATGG 


720 


ACCA/VGAGTT 


TGGTGTCAGC 


ACTCTGGTGG 


AGGAGGAGTT 


CGGGCAGAGG 


CGGCAGGACC 


780 



TGTACAGTGC 


CCGGGACCTG 


CAGGGCCTCA 


AAGGGGAGAC 


AATGATTCTT 


ACCCTCAAGG 


TGCTGGTGAA 


CGTGAACCTG 


GTGGATAAGG 


AGAAGAAGCC 


TGACTACCTG 


CCCTATGCCG 


AAAAACCTCG 


CTCTATCCTG 


TCCAAGTATG 


CCTTCCGCTT 


GGAGCAGGGC 


GGCACGGCTG 


TCCGGGCCAA 


GCTGCGGCTG 


CAGGCTCAGT 


CCGAATACCT 


CACGCCTGAG 


GAGATGGTGA 


AAATCCGCAA 


GAAGGAGAAG 


GAGGTAGTAG 


ACCAGACTCA 


GGATGGGGAC 


TTTGGTTCCA 


CCGAAGTGGA 


GGAGGAGAAG 


GAGCCTGTGC 


TGGAGAACAT 


GGACATCAGT 


GATGAGGAGG 


AGGTGCTGGA 


GGAGGACGAG 


GCGGAGCTGG 


GGCTGCGACA 


GTTACAGCAG 


CTACAGCAGC 


TTGTGAAGAA 


GCTGGAGTCT 


CGCCAGCGGG 


AGGGGGCCAT 


CGTGTTCAAC 


GCCACGTCCG' 


CCTACGGGCT 


GGCTGGCAAT 


CGCGAGGAGC 


AGGAGCGCTC 


AGCCAACGGT 


GGCTCOiAAT 


CGGTGAACCT 


GGACGAGGAG 


AAGCAGCAGC 


TGGACGAGGA 


ACCGATCGTG 


AATAGGGGGC 


AAGGGCTGCT 


GGAGACCAc/ 


GTGCAGAAGG 


TGCCCTCAGC 


CGTGTAGTGC 


ATCGAGGATA 


GGGAGGAATA 


CCGAGGCTTC 


ACACAGGACT 


TTAAGATCGA 


ATA0GTGGAT 


GAGACGGGCC 


AGCTGTCGCA 


COGCTTCCAT 


GGCAAGGGCT 


AGAAGCTGGA 


(CGAGGAGGCG 


CTCCTGAAGA 


CCGTGGCCCT 


GCTCCAGGAG 


AAGCAGAAGG 



48 








CCGTGGAGCA 


TGCCATTGAT 


TCCTTCCGAG 


840 


ACAAAGGCGT 


GCTGCAGGAG 


GAGGAGGACG 


900* 


AGCGGGCAGA 


GAAAAATGTG 


GAGCTGCGGA 


/960 


AGGACGAGAG 


CGTGGACGAC 


CTGGCGCAGC 


1020 


ACGAAGAGCT 


TGAAGGGGAG 


CGGCCACATT 


1080 


ATGGCCTGCG 


GGAGCGGGAG 


CTGGAGGAGA 


1140 


CCCTGAGCAC 


AGTGGGGCCC 


/ggctggcct 


1200 


CCTTTAAAAA 


GACCAAGC0G 


AGGGTGAAGA 


1260 


TGCGGGCAGA 


TGACTTGCTG 


CCTCTCGGGG 


1320 


GACTGCGGGG 


ACGidGGTCGC 


CGCCGAGTGT 


1380 


CTCAGCCCCT 


BCCGTCGGAC 


GACACCCGAG 


1440 


AAGGTGGAGC 


TCCACCGCCG 


GGGTCCCCGC 


1500 


agctgcXgaa 


GCAGCTGGAG 


AAGGGACGCC 


1560 


TGCGAGACAG 


TGGCGAGAAG 


GTGGTGGAGA 


1620 


GTrGGGAGGA 


GGATGAGGAT 


CCCGAGCGGA 


1680 


AGTTCTGCCG 


CACCTTGGGG 


GAGATCCCCA 


1740 


AGGAGGAGCT 


CATGGACTTT 


GAACGGGATG 


1800 


CTGACGGGGA 


GGAGAACATC 


GGCTGGAGCA 


1860 


AGGATTTCTC 


TGCTTCCTCC 


ACCACCATCC 


1920 


TGGCAGCTGC 


CCTGCTCCTG 


TGTCAGAACA 


1980 


TGGCCCGGGT 


GAAGGCCCCC 


AACAAGTCGC 


2040 


AGATGGCCAT 


CGATGACAAG 


TACAGCCGGA 


2100 


TCAAGGAGAA 


GGACGGCTAC 


AAACCCGACG 


2160 


GGAAACTCAC 


ACCCAAGGAG 


GCTTTCCGGC 


2220 


CAGGCAAGAT 


GAAGACAGAG 


CGGCGGATGA 


2280 


AGATGAGCTC 


CAGCGACACG 


CCCCTGGGCA 


2340 


CTCAGAAGAC 


CCCCTACATC 


GTGCTCAGCG 


2400 
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GCAGCGGCAA GAGCATGAAC GCGAACACCA TCACCAAGTG A€AGCGCCCT CCCGTAGTCG 2460 
GCCCTGCCTC AACCTTCATA TTAAATAAAG CTCCCTQelT ATTTTTAAAA AAAAAAAAAA 2520 
AAAAAAA / 2527 



